Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
La | 6141 | 474 | 6 | 79.0000 |
En | 2929 | 157 | 2 | 78.5000 |
Et | 1836 | 69 | 1 | 69.0000 |
Il | 4519 | 157 | 3 | 52.3333 |
Les | 5616 | 440 | 9 | 48.8889 |
Cette | 1208 | 92 | 2 | 46.0000 |
Le | 9127 | 592 | 17 | 34.8235 |
Comme | 452 | 34 | 1 | 34.0000 |
Mais | 2463 | 64 | 2 | 32.0000 |
Ce | 1930 | 96 | 3 | 32.0000 |
Pour | 1952 | 90 | 3 | 30.0000 |
Un | 1726 | 139 | 5 | 27.8000 |
On | 1627 | 82 | 3 | 27.3333 |
Elle | 1065 | 49 | 2 | 24.5000 |
C'est | 1641 | 73 | 3 | 24.3333 |
À | 313 | 24 | 1 | 24.0000 |
Après | 730 | 23 | 1 | 23.0000 |
Au | 990 | 69 | 3 | 23.0000 |
Une | 1626 | 128 | 6 | 21.3333 |
Ils | 853 | 40 | 2 | 20.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
milliards | 1099 | 6 | 83 | 0.0723 |
ton | 163 | 1 | 10 | 0.1000 |
types | 71 | 1 | 9 | 0.1111 |
capable | 142 | 1 | 9 | 0.1111 |
envie | 133 | 2 | 15 | 0.1333 |
dame | 56 | 1 | 7 | 0.1429 |
désir | 62 | 1 | 7 | 0.1429 |
souci | 71 | 1 | 7 | 0.1429 |
centaines | 167 | 1 | 7 | 0.1429 |
studio | 86 | 1 | 7 | 0.1429 |
lors | 1289 | 7 | 45 | 0.1556 |
2008 | 1494 | 18 | 108 | 0.1667 |
tenter | 148 | 1 | 6 | 0.1667 |
pétrolier | 54 | 1 | 6 | 0.1667 |
Barack | 129 | 1 | 6 | 0.1667 |
pôle | 64 | 1 | 6 | 0.1667 |
télécoms | 48 | 1 | 6 | 0.1667 |
étoiles | 32 | 1 | 6 | 0.1667 |
forum | 53 | 1 | 6 | 0.1667 |
réelle | 75 | 1 | 6 | 0.1667 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II